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There is a wide literature on change point tests, but the case of variables with infinite variances 
is essentially unexplored. In this paper we address this problem by studying the asymptotic 
behavior of trimmed CUSUM statistics. We show that in a location model with i.i.d. errors in the 
domain of attraction of a stable law of parameter < a < 2, the appropriately trimmed CUSUM 
process converges weakly to a Brownian bridge. Thus, after moderate trimming, the classical 
method for detecting change points remains valid also for populations with infinite variance. We 
note that according to the classical theory, the partial sums of trimmed variables are generally 
not asymptotically normal and using random centering in the test statistics is crucial in the 
infinite variance case. We also show that the partial sums of truncated and trimmed random 
variables have different asymptotic behavior. Finally, we discuss resampling procedures which 
enable one to determine critical values in the case of small and moderate sample sizes. 
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1. Introduction 

In this paper we are interested in detecting a possible change in the location of indepen- 
dent observations. We observe X\, . . . , X n and want to test the no change null hypothesis 

Hq : X\,X 2 , ■ • ■ , X„ are independent, identically distributed random variables 

against the r changes alternative 



Ha-.X, 



( l<i<rai, 

ej + ci, ni < j < ri2, 

ej+c 2 , n 2 <j<n 3 , 

I ej + c r , n r < j < n. 
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It is assumed that 

ei, . . . , e„ are independent, identically distributed random variables, (1-1) 

that Co = 0, Cj 7^ Cj+i, i = 0, . . . ,r — 1, and that 1 < n\ < ri2 < • ■ ■ < n r < n are unknown. 
In our model, the changes are at time rij, 1 < j < r. Testing Hq against Ha has been 
considered by several authors. For surveys, we refer to Brodsky and Darkhovsky [7], Chen 
and Gupta [8] and Csorgo and Horvath [9] . If the observations have finite expected value, 
then the model is referred to as changes in the mean. 

Several of the most popular methods are based on the functionals of the CUSUM 
process (tied down partial sums) 

L™tJ I , I n 

3=1 3=1 

If Hq holds and < a 2 = var Xi < oo , then 

J-M n (t) V -H ] aB(t), (1.2) 

where {B(t),0 < t < 1} is a Brownian bridge. If a n is a weakly consistent estimator for 
cr, that is, a n — > a in probability, then 

^—M n {t) V ^l ] B{t). (1.3) 

Functionals of (1.3) can be used to find asymptotically distribution- free procedures to 
test Hq against Ha- The limit results in (1.2) and (1.3) have been extended in sev- 
eral directions. Due to applications in economics, finance, meteorology, environmental 
sciences and quality control, several authors have studied the properties of M n (t) and 
especially (1.3) for dependent observations. For relevant references, we refer to Horvath 
and Stcincbach [20]. The case of vector- valued dependent observations is considered in 
Horvath, Kokoszka and Steinebach [19]. We note that in the case of dependent obser- 
vations, a 2 = Um n _^oo vai^n" 1 / 2 -^7)1 so the estimation of a is considerably harder 
than in the i.i.d. case (see Bartlett [3], Grenander and Rosenblatt [13] and Parzcn [30]). 
The rate of convergence in (1.3) may be slow, so the asymptotic critical values might be 
misleading; hence, resampling methods have been advocated in Huskova [21]. With very 
few exceptions, it has been assumed that at least EX? is finite. In this paper we are 
interested in testing Hq against Ha when EX? = 00. 
We assume that 



Xi, X2, . ■ ■ belong to the domain of attraction of a stable random variable £ Q ,^ ^, 
with parameter < a < 2 \ ■ ) 



and 



Xj is symmetric when a = 1 . 



(1.5) 
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This means that 

/ n \ 

/ v ~ \ / T-l 

(1.6) 

for some numerical sequences a n and b n . The necessary and sufficient conditions for (1.6) 
are 

r > *> , ,• Pj*i < =*} n 7^ 

urn — — =p and lim — — =q (1-7) 

i^oo L(t)t- a t^oo L(t)t- a v ; 

for some numbers p > 0, g > with p + q = 1 and where L is a slowly varying function 
at oo. 

Aue et al. [2] studied the properties of M n (t) under conditions Ho, (1.4) and (1.5). 
They used maxi<j< n \Xj\ as the normalization of M n (t) and showed that 



7„ Z i<j<« 



— M„(i) M J > Yn =max\X j \. (1.8) 



Here, B a (t) = W a (t) — tW a (i) is an a-stable bridge, W a (t) is an a-stable process (see 
also Kasahara and Watanabe [22], Section 9) and Z is a random norming factor whose 
joint distribution with W a (t) is described in [2] explicitly. Nothing is known about the 
distribution of the functionals of B a (t) / Z and therefore it is nearly impossible to deter- 
mine critical values needed to construct asymptotic test procedures. Hence, resampling 
methods (bootstrap and permutation) have been tried. However, it was proven that the 
conditional distribution of the resampled M n (t) /*/„ , given Xi , . . . , X n , converges in distri- 
bution to a non-degenerate random process depending also on the trajectory (X\, X%, . . .). 
So, resampling cannot be recommended to obtain asymptotic critical values. This result 
was obtained by Aue et al. [2] for permutation resampling and by Athreya [1], Hall [18] 
and Berkes et al. [4] for the bootstrap. No efficient procedure has been found to test Hq 
against Ha when EX'j = oo . 

The reason for the 'bad' behavior of the CUSUM statistics described above is the 
influence of the large elements of the sample. It is known that for i.i.d. random variables 
X\,X%, ... in the domain of attraction of a non- normal stable law, the jth largest element 
of . . . , | X n \ has, for any fixed j, the same order of magnitude as the sum S n = 
X\ + ■ ■ ■ + X n as n — > oo. Thus, the influence of the large elements in the CUSUM 
functional does not become negligible as n — > oo and, consequently, the limiting behavior 
of the CUSUM statistics along different trajectories (Xi, X2, ■ ■ ■) is different, rendering 
this statistics impractical for statistical inference. The natural remedy for this trouble 
is trimming, that is, removing the d(n) elements with the largest absolute values from 
the sample, where d(n) is a suitable number with d(n) — > 00, d(n)/n — > 0. This type 
of trimming is usually called modulus trimming in the literature. In another type of 
trimming, some of the largest and smallest order statistics are removed from the sample 
(see, e.g., Csorgo et al. [11, 12]). Under suitable conditions, trimming indeed leads to 
a better asymptotic behavior of partial sums (see, e.g., Mori [27-29], Mailer [25, 26], 
Csorgo et al. [10-12], Griffin and Pruitt [14, 15] and Haeusler and Mason [16, 17]). Note, 
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however, that the asymptotic properties of trimmed random variables depend strongly 
on the type of trimming used. In this paper, trimming means modulus trimming, as 
introduced above. Griffin and Pruitt [14] showed that even in the case where the Xj 
belong to the domain of attraction of a symmetric stable law with parameter < a < 2, 
the modulus trimmed partial sums need not be asymptotically normal. Theorem 1.5 
reveals the reason for this surprising fact: for non-symmetric distributions F, the center 
of the sample remains, even after modulus trimming, a non-degenerate random variable, 
and no non-random centering can lead to a central limit theorem. In contrast, a suitable 
random centering will always work and since the CUSUM functional is not affected by 
centering factors, even in the case of 'bad' partial sum behavior, the trimmed CUSUM 
functional converges to a Brownian bridge, resulting in a simple and useful change point 
test. 

To formulate our results, consider the trimmed CUSUM process 

[nt] n 

T„ (t) = ]T Xjl^XA < Vn , d } - J— L Y, X i T {\ X j\ ^ Vn.d], < t < 1, 
i=i n 3=1 

where r\ n ^d is the dth largest value among \X\\, . . . , |X n |. 
Let 

F(t) = P{X 1 <t} and H{t) = P{\X 1 \> t}. 
The (generalized) inverse (or quantile) of H is denoted i/ _1 (i). We assume that 

lim d(n)/n = (1.9) 

n— >oo 

and 

lim d(n)/ (log n) 7+e — oo with some e > 0. (1-10) 

n— >oc 

For the sake of simplicity (see Mori [27]), we also require that 

F is continuous. (1-H) 



Let 

Al = -^{H-\d/n)fd. (1.12) 
2 — a 

Our first result states the weak convergence of T n (t)/A n . 
Theorem 1.1. If H , (1.4), (1.5) and (1.9)-(1.11) hold, then 

±T n (t)^ ] B(t), (1.13) 

where {B(t),0 < t < 1} is a Brownian bridge. 
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Since A n is unknown, we need to estimate it from the sample. We will use 
Al=Y^{X j I{\X j \<r 1ntd }-X n , d f and &l = -A 2 n , 



3 = 1 



where 



1 n 

X n , d = -J2 X jH\ X A<Vn,d}- 



n 

3 = 1 



We note that A n /A n — > 1 almost surely (see Lemma 4.7). 
Theorem 1.2. If the conditions of Theorem 1.1 are satisfied, then 



^T n {t)^ B(t). (1.14) 



In the case of independence and < a 2 = v&r Xj < oo, we estimate a 2 by the sample 
variance. So, the comparison of (1.3) and (1.14) reveals that in case of EX 2 = oo, we still 
use the classical CUSUM procedure; only the extremes are removed from the sample. 
The finite-sample properties of tests for Hq against Ha based on (1.14) are investigated 
in Section 3. 

In the case of a given sample, it is difficult to decide if EX 2 is finite or infinite. Thus, 
for applications, it is important to establish Theorem 1.2 when EX 2 < oo. 

Theorem 1.3. If Hq, (1.9), (1.10) and EX 2 < oo are satisfied, then (1.14) holds. 

Combining Theorems 1.2 and 1.3, we see that the CUSUM-based procedures can always 
be used if the observations with the largest absolute values are removed from the sample. 

We now outline the basic idea of the proofs of Theorems 1.1 and 1.2. It was proven 
by Kiefer [23] (see Shorack and Wcllncr [33]) that r\ n ,d is close to H~ 1 (d/n) and thus it 
is natural to consider the process obtained from T n (t) by replacing r\ n _ d with H~ 1 {d/n). 
Let 

\nt\ 

V n (t) = J2( x jH\ x j\ < H-\d/n)} - EiXjl^Xj] < H-\d/n)})) 

3 = 1 

and 

|nij 

K(t) - E^l^l ^ ^} - E(X 3 I{\X 3 \ < r ln4 })). 

3 = 1 

Since V n (t) is a sum of i.i.d. random variables, the classical functional central limit 
theorem for triangular arrays easily yields the following result. 
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Theorem 1.4. If the conditions of Theorem 1.1 are satisfied, then 

±V n (t)^W(t), 

where {W(t), < t < 1} is a standard Brownian motion (Wiener process). 

In view of the closeness of r\ n ^i and H~ 1 (d/n), one would expect the asymptotic be- 
havior of V n (t)/A n and V*{t)/A n to be the same. Surprisingly, this is not the case. Let 

m(i) = J B[XiJ{|X 1 |<f}-X 1 J{|Xi|<i?- 1 (d/n)}] ! t > 0. 
Theorem 1.5. If the conditions of Theorem 1.1 are satisfied, then 
1 



— — max 

A„ Kk<n 



k 

^[XjilttXA^^-IttXAKH-^d/n)})-™^) 
3=1 



o P (l). 



By Theorem 1.5, the asymptotic properties of the partial sums of trimmed and trun- 
cated variables would be the same if n\m{ji nt d)\ = op{A n ) were true. However, this is not 
always the the following example shows. 

Example 1.1. Assume that X\ is concentrated on (0, +oo) and has a continuous density 
/ which is regularly varying at oo with exponent —(a + 1) for some < a < 2. Then, 

nm i" M) A^(o,i), 

where 

B,, 



ad 3 / 2 



nH'{H' 1 (d/n)) 

We conjecture that the centering factor nm(j] n ^i)/A n and the partial sum process 

\nt\ 

< H-\d/n)} - EiXjlUXjl < H-\d/n)})), < t < 1, 

3=1 

are asymptotically independent under the conditions of Example 1.1. Hence, by Theo- 
rem 1.5 we would have 

1 L"*J /2 — \ 1//2 

— ^(XjIUXjl < VnA c n ) W(t)+t(—^\ £, 

where {W(t),0 < t < 1} and £ are independent, W(t) is a standard Wiener process, £ is 
a standard normal random variable and c n = EX\I{\X\ \ < H~ 1 (d/n)}. 
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In view of Theorem 1.5, the normed partial sum processes of Xj/{|X,-| < 77,^} — 
Tn(r] n ^d) and Xj/{|Xj| < H~ 1 {d/n)} have the same asymptotic behavior and thus the 
same holds for the corresponding CUSUM processes. By Theorem 1.4, the CUSUM pro- 
cess of X,/{|X,-| < H^ 1 (d/n)} converges weakly to the Brownian bridge and the CUSUM 
process of < rj n d\ — rn{rj n _d) clearly remains the same if we drop the term 

m(r] nt d)- Formally, 



max 

Kk<r, 



k 



Y^XjIUXjI < Vn , d } - -]>>,/{ |X, I < r, n , d ] 



i=i 



^XjIUXjI < H-\d/n)} -Y^S,I{ .V, < H-\d/n)} 



i=i 



< 2 max 

Kk<r, 



Y}Xj{I{\Xi\ < r, n , d } - I{\Xj\ < H-\d/n)}) - m(r?„, d )] 
3=1 



(1.15) 



Thus, even though the partial sums of trimmed and truncated variables are asymptot- 
ically different due to the presence of the random centering m(r) ntd ), the asymptotic 
distributions of the CUSUM processes of the trimmed and truncated variables are the 
same. 

The proofs of the asymptotic results for Y^j=i XjI{\Xj\ < r] n ,d} in Griffin and Pruitt 
[14, 15], Mailer [25, 26], Mori [27-29] are based on classical probability theory. Csorgo 
et at [10-12] and Haeusler and Mason [16] use the weighted approximation of quantile 
processes to establish the normality of a class of trimmed partial sums. The method of 
our paper is completely different. We show in Theorem 1.5 that after a suitable ran- 
dom centering, trimmed partial sums can be replaced with truncated ones, reducing the 
problem to sums of i.i.d. random variables. 



2. Resampling methods 

Since the convergence in Theorem 1.1 can be slow, critical values in the change point test 
determined on the basis of the limit distribution may not be appropriate for small sample 
sizes. To resolve this difficulty, resampling methods can be used to simulate critical values. 
Let 

Xj = XjI{\Xj\ < r] n ^} - X riid , 1 < j < n, 

be the trimmed and centered observations. We select m elements from the set 
{xi, X2, ■ ■ ■ , x n } randomly (with or without replacement), resulting in the sample 
yi, ■ ■ ■ ,y m - If we select with replacement, the procedure is the bootstrap; if we select 
without replacement and m = n, this is the permutation method (see Huskova [21]). We 
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now define the resampled CUSUM process 

\rnt\ 



T m , n (t) =^yj- Vi- 



nt 

i=i 3=1 

We note that, conditionally on X\,Xi, . . . ,X n , the mean of yj is and its variance is 

Theorem 2.1. Assume that the conditions of Theorem 1.1 are satisfied and draw m = 
m(n) elements yi, . . .,y m from the set {xi, . . . ,x n } with or without replacement, where 

m = m{n) — > oo as n — > oo (2-1) 

and m(n) <n in case of selection without replacement. Then, for almost all realizations 
of X\ , X2 , ■ • ■ , we have 

1 TmA t) v n ] B{ t), 



e n \frn 

where {B(t),0 < t < 1} is a Brownian bridge. 

By the results of Auc et al. [2] and Bcrkcs et al. [4], if we sample from the original 
(untrimmed) observations, then the CUSUM process converges weakly to a non-Gaussian 
process containing random parameters and thus the resampling procedure is statistically 
useless. 

If we use resampling to determine critical values in the CUSUM test, we need to study 
the limit also under the the alternative since in a practical situation we do not know 
which of Hq or Ha is valid. As before, we assume that the error terms {e^} arc in the 
domain of attraction of a stable law, that is, 

,. P{e 1 >t\ , ,. P{e l <-t\ 

lim — \\ '-=p and lim \ , ~ J -=q, (2.2) 

t^oo L(t)t- a t->oo L(t)t- a K ' 

where p>0, q>0, p + q = 1 and L is a slowly varying function at 00. 

Theorem 2.2. If H A , (1.1), (1.9)-(1.11), (2.1) and (2.2) hold, then for almost all 
realizations of X\, X2, ■ • • , we have that 

1 T m , n (t) V -H ] B(t), 



where {B(t),0 < t < 1} is a Brownian bridge 



In other words, the limiting distribution of the trimmed CUSUM process is the same 
under Hq and Ha, and thus the critical values determined by resampling will always work. 
On the other hand, under H A , the test statistic sup 0<t<1 \T n (t)\/A n goes to infinity, so 
using the critical values determined by resampling, we get a consistent test. 

We note that Theorems 2.1 and 2.2 remain true if (1.6) is replaced with EXj < 00. 
The proofs are similar to that of Theorem 2.1 but much simpler, so no details are given. 
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3. Simulation study 

Consider the model under Hq with i.i.d. random variables Xj, j = 1, . . . ,n, having dis- 
tribution function 

JP( .s_\q{l-t)- 1 - h fort<0, 
[) \l-p(l + <)- 15 fori>0, 

where p > 0, q > and p + q = 1. We trim the samples using d(n) = [n°- 3 \ . To 
simulate the critical values, we generate TV = 10 5 Monte Carlo simulations for each 
n e {100, 200, 400, 800} according to the model under the no change hypothesis and calcu- 
late the values of sup 0<t<1 \T n (t)\/(a n y/n), where T n (t) and a n are defined in Section 1. 
The computation of the empirical quantiles yields the estimated critical values. Table 1 
summarizes the results for p = q = 1/2 and 1 — a = 0.95. 

Figure 1 shows the empirical power of the test of Ho against Ha based on the statistic 
sup 0<t<1 \T n (t)\/(cr n y/n) for a single change at time k = m G {n/4,n/2} and each c\ & 
{— 3, — 2.9, . . . , 2.9, 3} for the same trimming as above (d(n) = [n°- 3 \) and a significance 
level of 1 — a = 0.95, where the number of repetitions is N — 10 4 . Note that depending on 

Table 1. Simulated critical values of 



su Po<t<i |T„(t)|/(a nV / n) for 1 - a 


= 0.95 




n = 100 n = 200 n = 400 


n = 800 


n = oo 


1.244 1.272 1.299 


1.312 


1.358 



d(n) = n 3 , k = n/4 d(n) = n 3 , k = n/2 




-3-2-10123 -3-2-10123 



Figure 1. Empirical power curves with a = 0.05, n — 100 (solid), n = 200 (dashed) and n = 400 
(dotted). 
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the sample size, we used different simulated quantiles (see Table 1). The power behaves 
best for a change point in the middle of the observation period (k = n/2). Due to the 
differences between the simulated and asymptotic critical values in Table 1, especially 
for small n, the test based on the asymptotic critical values tends to be conservative. 

4. Proofs 

Throughout this section we assume that Hq holds. Clearly 

H(x) = l-F(x)+F(-x), x>0, 
and by (1.7), we have that 

H- 1 (t)=t- 1/a K(t), if t < t , (4.1) 
where K(t) is a slowly varying function at 0. We also use 

d = d(n) -too. (4.2) 

Lemma 4.1. If H , (1.4), (1.5), (1.9) and (4.2) hold, then 

lim -L V arK(l) = l (4.3) 

n— too J\ z 

and 

n 

lim Y.ElXjlUXjl <H-\d/n)}-E[X I{\X 3 \ < ^(d/n)}]] 4 
1 a 



n—?oQ 1 

3=1 



(4.4) 



d(i7-!(d/n)) 4 4- a 
Proof. If 1 < a < 2, then 

lim EX 1 I{\X 1 \<H- 1 (d/n)} = EX 1 . 

71— >OG 

If a = 1, then by the assumed symmetry, EXiI{\Xi\ < H~ 1 (d/n)} = 0. In the case 
< a < 1 , we write 



E\X 1 \I{\X 1 \<H- 1 (d/n)}= / \x\dF(x) 

J-H- 1 (d/n) 



(d/r 
H^ 1 (d/n) 







xdH(x) 

H- 1 (d/n) 



-xH(x)\ H -i Wn ) + / 
Jo 



H(x)dx. 
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By Bingham et al. [6], page 26, 

y^> (l/(l-a))y 1 -"L(y) 

and therefore 



»^oo {a/{l-a))H- x {d/n)d/n 



Similarly, 



EXfJIIXil < fT _1 (d/n)} 
= / cc 2 dF(x) 

J-H- 1 (d/n) 

H- 1 (d/n) r-H- 1 (d/n) 

x 2 dH(x) = -x 2 H(x)\ H -i {d/n) + 2 / xiJ(a;)dx. 

Again using [6], page 26, we conclude that 

EXfI{\X!\ < H-^d/n)} _ a 



n^SL {H- l (d/n)) 2 d/n 2- a 



Hence, (4.3) is established. 
Arguing as above, we get 



rH-Ud/n) 

EX^I{\X 1 \<H" 1 {d/n)} = - / x 4 dH(x) 

Jo 



f-H-^d/n) 

= -x 4 H(x)\ H -i {d/n) +4 I x 3 H(x)dx 



and therefore 

EXtl{\X 1 \<H- 1 (d/n)} a 
hm = . 

»^oo (H-i-id/n^d/n A- a 

Similarly, 

.. E\Xx\H{\Xi\<H- x {d/n)} 
lim - 



{H-^d/njfd/n 3- a 

completing the proof of (4.4). □ 



Asymptotics of trimmed CUSUM statistics 1355 

Proof of Theorem 1.4. Clearly, for each n, XjI{\Xj\ < H~ 1 (d/n)}, l<j< n, are 
independent and identically distributed random variables. By Lemma 4.1, we have 



i EU ElXiHlXjl < H- 1 (d/n)} - EjXjIjlX,] < H-\d/n)W = 
{Yr i=1 ™{X 3 I{\XA<H-^d/n)})f 



so the Lyapunov condition is satisfied. Hence, the result follows immediately from Sko- 
rokhod [34]. □ 

A series of lemmas is needed to establish Theorem 1.5. Let rj n: i > r) n> 2 > • • • > r)n,n 
denote the order statistics of |Xl|, . . . , \X n \, starting with the largest value. 

Lemma 4.2. If Hq and (1.11) hold, then 

{H(r) n< k), l<k<n} = {S k /S n+1 ,l <k<n}, 

where 

Sk = ei-\ he fc , 1 < k < n, 

and ei, e2, . . • , e„+i are independent, identically distributed exponential random variables 
with Eej = 1 . 

Proof. The representation in Lemma 4.2 is well known (see, e.g., Shorack and Wellner 
[33], page 335). □ 

Let rj nt d{j) denote the dth largest among \X\\, . . . , |X,_i|, [Xj+i], . . . , \X n \. 

Lemma 4.3. If H , (14), (1.5), (1.9), (1.11) and (2.1) hold, then 

< Vn.A - < Vn,dU)})\=op(A n ). 

i=i 

Proof. First, we note that r) n ^(j) = ?7n,d or r/ ntd (j) = 77^+1 • Hence, 

H(Vn,d) > H (Vn,d) 



H(ji nA {j)) H(ri ntd+1 )' 
By Lemma 4.2 and the law of large numbers, we have 
H(Vn,d) v S d S d 1 



H{Vn.d+i) S d +i Sd + e-d+i 1 + ed+i/Sd 
Furthermore, by the central limit theorem, we conclude that 

Sr^ril + Opir- 1 / 2 )) 



l + Opid- 1 ). 
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and thus 

H{ Vn4 ) = -{l + P {d- 1 / 2 )). 
n 

Hence, for every e > 0, there is a constant C = C(e) and an event A = A(e) such that 
P(A) > 1 — e, and on A, 

H( Vn , d ) C 
H(jln,d+i) a 

and 

H(w)>^(l-^)- (4-6) 

We note that H{\Xj\) is uniformly distributed on [0, 1] and is independent of i]n,d{j)- So, 
using (4.5) and (4.6), we obtain that 

E[\X 3 (I{\X 3 \ < r/ n>d } - I{\Xj\ < VnAj)})\HA}} 
= E[\X j \I{r ]n4 {j)<\X j \<r }n>d }I{A}] 

< H- 1 (- (l ^))E[I{H(r, n , d ) < H(\XA) < H( Vn , d (j))}I{A}} 



Vd 



< h- 1 (- fi ^YeAh^U)) fi - ? ) 



n\ y/d 

<H I —II — ] £tf < fi"" 1 - 1 - 



vW/ ' ' rf \«\ ^/djjn + ld 

since H(rj n>d (j)) < H(ri n4+1 ) and, by Lemma 4.2, we have EH(r] nid+ i) = (d+l)/(n + l). 
The slow variation and monotonicity of if -1 yield 

Um ^((y(i-c/Vd) )=l 

n->oo L (d/n) 

and thus we get that 

n 

lim —^^^(/{l^-l < r/„,4 < VnAj)})\I{A}=0. 



Since we can choose e > as small as we wish, Lemma 4.3 is proved. □ 
Lemma 4.4. If the conditions of Lemma 4-3 are satisfied, then 

1 " 

-J-Y] \ m (Vn,d) -m(Vn,dti))\ =Op(l). 
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Proof. This can be proven along the lines of the proof of Lemma 4.3. □ 
Let 

& = Xj(I{\Xj\ < VnAj)} - < H- x (n/d)}) - m( W (j)). 

Lemma 4.5. If the conditions of Theorem 1.1 are satisfied, then there is an a > such 
that for all r > 1/a and < e < 1/2 , 

E£j = 0, (4.7) 
E$ =Eg=0((H- 1 (d/n)f(d 1/2+£ /n)+n 2r exp(-ad 2£ )), (4.8) 
EUj =Ete 2 =0((H- 1 (d/n)f(d 1 / 2+3 yn 2 )+n 2T cxp(-ad 2s )) (4.9) 
for 1 < j ' < n and l<i<j<n, respectively. 
Proof. It follows from the independence of Xj and rj nyd (j) that 

E£j = E(E(^\ Vntd (j))) = E(m( Vn , d (j)) - m( W (j))) = 0, 

so (4.7) is proved. 

The first relation in (4.8) is clear. For the second part, we note that 

Eg < 2EXf(I{\X 1 \ < VnA 1 )} - I{\Xi\ < H^(d/n)}) 2 + 2Em 2 (r ]n . d (l)) 

and 

EXfmiX^ < r) n , d (l)} - I{\X t \ < H-\d/n)}) 2 

< EX 2 I{i ln4 {l) <\Xt\< H-\d/n)} + EXfl{H-\d/n) <\X t \< ^(1)} 

< (i/- 1 (d/n)) 2 P{r 7 „, d (l) <\X,\< H-\d/n)} 

+ E{{ nn4 {l)) 2 I{H{r, n4 {\)) < HQXd) < d/n}). 

There are constants c\ and ci such that 

P{\S d -d\>xy/d}<cxp(~cix 2 ) if < x < c 2 d. (4.10) 
Let < e < 1/2. Using Lemma 4.2 and (4.10), there is a constant C3 such that 

P{A)>l-c 3 cM-ad 2£ ), (4.11) 

where 
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Let A c denote the complement of A. By (4.11), we have 

(fr- 1 (d/n)) 2 P{^ n , < ,(l) < < H-\d/n)} 

= (H-\d/n)) 2 (P(A c ) + <\Xt\< H-\d/n) : A}) 

< (H-\d/n)j 2 (c 3 eM-cid 2£ ) + p{~ <H{\Xi\) < - f 1 



: O ({H- l (d/n)f (exp(- Cl d 2e ) + ^-^j ) 



Similarly, by the independence of and r/ n ,d(l)) we have 

E{{ nn4 {l)) 2 I{H{ Vn4 {\)) < HQX^) < d/n}) 
<E{r& >d (l)/{ A c }) 

+ ^((ff-^dMl - #- 1/2 ))) 2 /{d/n(l - d £ ~ 1/2 ) < < d/n}) 

= S(^ id (l)/{^ c }) + (if- 1 (d/n(l-d £ - 1 / 2 ))) 2 ^ £ - 1 / 2 . 

Since if _1 (i) is a regularly varying function at with index — 1/a, for any r > 1/a, there 
is a constant C4 such that 

H^(t) <c 4 t~ T , 0<t<l. (4.12) 
By the Cauchy-Schwarz inequality, we have 

£< d (l)I{A C } < (i?^ d (l)) 1 /2( P (^))V2 < (Sr? 4 d(1)) l/2 c l/2 exp ^| d 2 £ 

Next, we use (4.12) and Lemma 4.2 to conclude that 

\ 4 771 / 1 "n+l — "3d 



E v 4 n Jl)<E v i d <ciE(-^) =c\E[\ 



S n +i J \ Sd 



< c 5 ( 1 + E(S n+1 - S d ) AT E^ ) < c 6 n AT 



(4.13) 



since has a Gamma distribution with parameter d and therefore -ES^ 4t < 00 if d > 
do(r). Thus, we have that 

EXl{I{\X x \ < T] n , d (l)} - I{\Xx\ < H-\d/n)}f 

= O (^(H- 1 (d/n)) 2 (d e+1 / 2 /n) + n 2T exp (~jd 2e 
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Similar arguments give 



Em 2 ( Vn , d (l)) =o( {H-\d/n)f{d s+1 ' 2 /n) + 



n 2-exp( -^-d 2e 



The proof of (4.8) is now complete. 

The first relation of (4.9) is trivial. To prove the second part, we introduce 77,^(1, 2), 
the dth. largest among |X 3 |, \X±\, . . . , \X n \. Set 

6,2 = Xi(H\Xi\ < VnA 1 , 2 )} - HlXil < H-\d/n)}) - m(ry„, d (l, 2)) 

and 

6,1 = X 2 (I{\X 2 \ < ?7n,d(l,2)} - 7{|X 2 | < H-\d/n)}) - m{r, ntd {\,2))- 
Using the independence of \X 2 \ and r) n , d (l,2), we get 

£6,26,1=0. (4.14) 

Next, we observe that 

66 = X 1 (I{\X 1 \ < r, n , d (l)} - 7{|Xx| < 77„, d (l,2)}6) 

- (m(r], hd (l)) -m(77„, d (l,2)))6 

+ X 2 (7{|X 2 | < r? n , d (2)} - 7{|X 2 | < ^(l^)})^ 

- (m(7?„, d (2)) - m(r) ntd (l, 2)))6, 2 + 6,26,1- 

So, by (4.14), we have 

£66 - £(*iJ{»Jm(1,2) < 1*1 1 < r?„, d (l)}6) + E((ro(Tfe |d (l,2)) - m(7 ? „ )d (l)))6) 

+ £7(Jf 2 /{Th,« l (l, 2) < |X 2 | < ^, rf (2)}6, 2 ) + £((m(?7„ )d (l,2)) - m( ?Krf (2)))6, 2 ) 

= 0>n,l H 1" fln,4- 

It is easy to see that 

?7«.d+2 < »?n,d(l) 2 ) < ^n.dCl) < ?7n,<i 

and 

Vn,d+2 < ri n> d(l,2) < r) n>d (2) < r) n , d . 

Hence, 

77(7?„,d(l)) > H(rj nid ) v S d _ e d+ i + e d+2 



77(r?„ )d (l,2)) H(r] ntd+2 ) S d+2 S d+2 
according to Lemma 4.2. Using (4.10), we get, for any < e < 1/2, 



P{\S d+2 - (d + 2)| > tPv'dTS} < exp(-cid 2n 
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The random variables e d +\ and e^+2 are exponentially distributed with parameter 1 and 
therefore 

P{e d +i > d 2e } = P{e d+2 > d 2s } < exp(-d 2£ ). 
Thus, for any < e < 1/2, we obtain 

I H(r] ntd (l, 2)) ~ | > 1 - c 8 exp(-c 9 d ) 

and similar arguments yield 



and 



with some constants C7, c% and eg. We now define the event A as the set on which 



H(? ]n , d (l, 2)) " d 1 " 2 *' ff( W (l,2)) " dl- 

and 
hold. Clearly, 

P(A C ) <3c 8 exp(-c 9 d 2£ ). 
Using the definition of £2, we get that 

a n ,i < -B(|X 1 |/{7 ? „, d (l,2) < \X X \ < VnA 1 )} 

x |X 2 ||7{|X 2 | < 7^(2)} - J{|X 2 | < H~\n/d)}\) 
+ E\X 1 \I{ Vn , d (l,2) < \X X \ <r ?n , d (l)}|m(7 7 „, d (2))| 
< SlXiUXal/^n.^l^) < \X ± \ < nn>d {l)}I{H- l {d/n) < \X 2 \ < r? M (2)} 
+ E\X 1 \\X 2 \I{Ti n ,d(l,2) < \X X \ < W (l)}/{r?„, d (2) < |X 2 | < H-\d/n)} 
+ J E|X 1 |/{r ? „, d (l,2)<|X 1 |<r ? „, (2 (l)}|m( W (2))| 

= fln,14 + On, 1,2 + Ira, 1,3- 

Using the definition of A, we obtain that 
on.i.1 < £|XiX 2 |J{r7„ >d (l,2) < |Xi| < 7 ?rM (l)}/{£r ^d/n) < |X 2 | < r/„ jd (2)}I{A} 
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+ £|X 1 X 2 |/{r ? „, d (l,2) < iX^KrhA^i^Wn) < \X 2 \ < i ln , d (2)}I{A c } 
< E^X^I^H^l, 2)) - < HQX^) < H( Vn , d (l, 2)) 

x I{A}I{H-\d/n) < \X 2 \ < r, n , d (2)}] + E{r,l d I{A c }) 



< H- 1 



-fl- ° w 



~^2l) <#(|Xi|)<#(77„, d (l,2)) 
d / 1 \ „ d 



Again using the independence of \Xi\, \X 2 \ and r7 n ,d(l, 2), we conclude that 
£fljtf( w (l,2))(l-^) ^(IX^tf^M)) 

■"{=('- 3^)^fl*D^ 

f ,„ „nn C7 d 1 d C7 1 

= ^(^(1,2))-^---— -< 



The Cauchy-Schwarz inequality yields 



E(vl d I{A c }) < (^, d ) 1/2 (P(^ c )) 1/2 = 0(n 2T exp(-|d 2£ 
for all r > 1/a on account of (4.13). We thus conclude 

an,i,i = o(^(ff- 1 (d/n)) 2 (d 1 / 2+3 7n 2 ) +n 2 -exp^-|d ; 
Similar, but somewhat simpler, arguments imply that 

a„,i, 2 + a„,i. 3 = O ((iT^d/n)) V /a+8 7" a ) + n 2T cxp (~yd 2e 
resulting in 

a n , 1 =o((H-\d/n))\d 1 / 2+3 yn 2 ) + n 2T exp(-^d? s 



(4.15) 



Following the lines of the proof of (4.15), the same rates can be obtained for a n ,2 and 
a n ,3- □ 
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Lemma 4.6. If the conditions of Theorem 1.1 are satisfied, then 

1 



— — max 

An l<k<r, 



£fc 

3=1 



P (1). 



Proof. It is easy to see that for any 1 < i\ < £2 < n, we have 

E (lt^ = ( £2 + + & - W 2 - *i + 

< (4-^i + l)(^i 2 + n^i6)- 
Lemma 4.5 and (1.12) yield 



n 



n W -l/2+ £ +n 2r + l 



exp(-ad e )] 



and 



for all < e < 1/6. Hence, we conclude that 

A" 



E{ <c 3 {e2-h + l) — [d~ 1/2+3£ + n 2T+2 cxp(-ad 2£ )}. 



So, using an inequality of Menshov (see Billingsley [5], page 102), we get that 



E max 

\ Kk<n 



£& 

J'=l 



< c 4 (logn) 2 ^[d- 1 / 2+3£ +n 2T+2 cxp(- a d 2e )] 



< c 4 ,4 2 [(log n) 2 eT 2/7 + cxp((2r + 2) log n + 2 log log n - ad 2e )] 
= A 2 o(l) as n—> 00, 

where e — 1/14 and d = (logn) 7 with any 7 > 7, resulting in 

k 



— E\ max 

Al \ l<k<n 



= 0(1). 



Markov's inequality now completes the proof of Lemma 4.6. □ 
Proof of Theorem 1.5. This follows immediately from Lemmas 4.3, 4.4 and 4.6. □ 
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Proof of Theorem 1.1. According to (1.15), Theorems 1.4 and 1.5 imply Theo- 
rem 1.1. □ 

Lemma 4.7. If the conditions of Theorem 1.1 are satisfied, then 

— > 1 a.s. 

Proof. This is an immediate consequence of Hacusler and Mason [16]. □ 

Proof of Theorem 1.2. From Slutsky's lemma, it follows that Lemma 4.7 and Theo- 
rem 1.1 imply the result. □ 

Proof of Example 1.1. Since H'{x) = —f(x), our assumptions imply that H'(x) is 
also regularly varying at oo. By elementary results on regular variation (see, e.g., [6]), it 
follows that 



H{x) = 1 - F(x) = f X f(t) dt ~ -xf(x) 

Jx a 



as x — > oo . 



Hence, H 1 is regularly varying at and therefore the function {H 1 (t))' = 1 / H' (H 1 {t)) 
is also regularly varying at 0. Also, 

d f x 

m (x) = — / tf(t)dt = xf(x)r~jaH(x) asx-^oo 



da; _ u 

and therefore m'(i?~ 1 (t)) ~ ta. Using Lemma 4.2, the mean value theorem gives 
nm(r) nid ) ■p nm(H^ 1 {S d / S n+ i)) _ n(l(S d / S n+1 ) ~ l(d/n)) _ f S d d 



B n B n B n B n \S n +i n 

where is between S d / S n +\ and d/n, and t(£) = m(iJ _1 (i)). It follows from the central 
limit theorem for central order statistics that 

Sd ^ V -+N(0,1). (4.16) 



d 1 / 2 \S n +i n 
The regular variation of £' and (4.16) yield 

£'(£ n )/£'(d/n)^l in probability. 
The result now follows from (4.16) by observing that 

The proof of Theorem 1.3 is based on analogs of Theorems 1.4, 1.5 and Lemmas 4.3-4.7 
when EX 2 < oo. 
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Lemma 4.8. If the conditions of Theorem 1.3 are satisfied, then 

-= ^{XjlftXA < H-\d/n)} EiXJUX^ < H-\d/n)}]) V -H ] aW(t), 
i=i 

where a 2 — var X\ . 

Proof. By EX 2 < oo, we have 

ElXJilXxl < H~\d/n)} - E[XxI{\Xi\ < H~\d/n)}] - (X 1 - EX^f — > 
as n — > oo. So, using Levy's inequality [24], page 248, we get 
1 



-= max 

In Kk<n 



J2(XiH\Xi\<H-\d/n)} 

3=1 

- ElXxI^Xx] < H-^d/n)}} - (X 3 - EX X )) 



= o P (l). 



Donskcr's theorem (see [5], page 137) now implies the result. 
Lemma 4.9. If the conditions of Theorem 1.3 are satisfied, then 



□ 



and 



1 - 

-ft52\Xi(n\Xj\<rin. 

3=1 

1 " 
-i=22\m(r} n ,d 



d }-I{\X j \<r, n , d (j)})\=o P (l) 



) -m(7?„,d0'))| = o P (l). 



Proof. We adapt the proof of Lemma 4.3. We recall that A is an event satisfying (4.5), 
(4.6) and P(A) > 1 — e, where e > is an arbitrary small positive number. We also showed 
that 

EQXtfilXA < r, n , d } - I{\Xj\ < Vn , d (])W{A}) 
<H -i(<L( l C\\d+lC 



y/dj J n + l d 
for some constant C. Assumption EX 2 < oo yields 

limsupx 1 ^ 2 iJ _1 (a;) < oo 
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and therefore 
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n^oo \ n \ y/dj J n + 1 a 



for all C > 0. Thus, for all e > 0, we have 
1 " 

lim -^E E l X i( J {l^'l^ Vn,d}-H\Xj\< fh,dV)})\HA} = 0- 

V 3=1 

Since we can choose e > as small as we wish, the first result is proved. The second part 
of the lemma can be established similarly. □ 

Lemma 4.10. If the conditions of Theorem 1.3 are satisfied, then for all < e < 1/2, 

EZj=0, l<j<n, 

Et 2 = Eg = 0((H- l (d/n)fd ll2+s /n + ncxp(-ad 2E )), l<j<n, 
E&tj = = 0((H-\d/n)) 2 d 1/2+3£ /n 2 + ncxp(-ad 2e )), l<i^j<n. 

Proof. The proof of Lemma 4.5 can be repeated, only (4.12) should be replaced with 

ff _1 (t)<C* -1/2 , 0<*<1. (4.17) 

□ 

Lemma 4.11. If the conditions of Theorem 1.3 are satisfied, then 

k 



1 

—= max 

y/n l<k<n 



3=1 



= Op(l). 



Proof. Following the proof of Lemma 4.6, we get 

2 



E max 

\ Kk<r, 



E6 

3=1 



< cin(logn) 2 [d" 1/2+3e + n 3 exp(-ad 2e )] = o(n) 



as n — > oo. Markov's inequality completes the proof of Lemma 4.11. 
Lemma 4.12. If the conditions of Theorem 1.3 are satisfied, then 



(4.18) 

□ 



-= max 

In l<k<n 



< VnA - H\Xi\ < H-\d/n)}) - m( Vn , d )} 

3=1 



= P (1). 
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Proof. It follows immediately from Lemmas 4.9 and 4.11. □ 



Proof of Theorem 1.3. By Lemmas 4.8 and 4.12, we have that 
It is easy to see that 

p , _2 



n 

which completes the proof of Theorem 1.3. □ 
Proof of Theorem 2.1. We show that 

maX1 ^ 1 ^ 1 a.s. (4.19) 

By Lemma 4.7 it is enough to prove that 

maxi<j<„ \xj\ 



a.s. 



It follows from the definition of Xj that 

max \xj \ < %,„ + \X nd \ < 2%, „. 

l<j<n 

Using Kicfcr [23] (see Shorack and Wcllncr [33]), we get 

— > a.s. 

Since (4.19) holds for almost all realizations of X\,X%, . ■ . Theorem 2.1 is implied by 
Rosen [32] when we sample without replacement and by Prohorov [31] when we sample 
with replacement (bootstrap). □ 

Proof of Theorem 2.2. This can be established along the lines of the proof of Theorem 
2.1. □ 
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